Phase-Aware Non-negative Spectrogram Factorization
نویسندگان
چکیده
Non-negative spectrogram factorization has been proposed for single-channel source separation tasks. These methods operate on the magnitude or power spectrogram of the input mixture and estimate the magnitude or power spectrogram of source components. The usual assumption is that the mixture spectrogram is well approximated by the sum of source components. However, this relationship additionally depends on the unknown phase of the sources. Using a probabilistic representation of phase, we derive a cost function that incorporates this uncertainty. We compare this cost function against four standard approaches for a variety of spectrogram sizes, numbers of components, and component distributions. This phase-aware cost function reduces the estimation error but is more affected by detection errors.
منابع مشابه
Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization
This study presents an automatic glottal inverse filtering (GIF) technique based on separating the effect of the glottal main excitation from the impulse response of the vocal tract. The proposed method is based on a non-negative matrix factorization (NMF) based decomposition of an ultra short-term spectrogram of the analyzed signal. Unlike other state-of-theart GIF techniques, the proposed met...
متن کاملSpectrogram Factorization Using Phase Information
Spectrogram factorization methods have been proposed for single channel source separation [1–4], audio analysis [5–8] and more recently multichannel source separation [9–11]. All spectrogram factorization approaches incorrectly assume that the mixture spectrogram is the sum of the source spectrograms. In fact, the mixture spectrogram depends on the source spectrograms and the phase difference b...
متن کاملBeyond NMF: Time-Domain Audio Source Separation without Phase Reconstruction
This paper presents a new fundamental technique for source separation of single-channel audio signals. Although nonnegative matrix factorization (NMF) has recently become very popular for music source separation, it deals only with the amplitude or power of the spectrogram of a given mixture signal and completely discards the phase. The component spectrograms are typically estimated using a Wie...
متن کاملDiscovering hierarchical speech features using convolutional non-negative matrix factorization
Discovering a representation that reflects the structure of a dataset is a first step for many inference and learning methods. This paper aims at finding a hierarchy of localized speech features that can be interpreted as parts. Non-negative matrix factorization (NMF) has been proposed recently for the discovery of parts-based localized additive representations. Here, I propose a variant of thi...
متن کاملAcoustic event recognition using dominant spectral basis vectors
This paper proposes a novel filter bank composed of dominant Spectral Basis Vectors (SBVs) in a spectrogram. Spectral envelopes represented by the SBVs have shown to be excellent characteristic features for discriminating different acoustic events in noisy environment. Non-negative Matrix Factorization (NMF) and non-negative K-SVD (NKSVD) for part-based and holistic representations extract domi...
متن کامل